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A New Proof for the DoF Region of the MIMO 

Networks with No CSIT 

Borzoo Rassouli, Chenxi Hao and Bruno Clerckx 


Abstract — In this paper, a new proof for the degrees of 
freedom (DoF) region of the A'-user multiple-input multiple- 
output (MIMO) broadcast channel (BC) with no channel state 
information at the transmitter (CSIT) and perfect channel state 
information at the receivers (CSIR) is provided. Based on this 
proof, the capacity region of a certain class of MIMO BC with 
channel distribution information at the transmitter (CDIT) and 
perfect CSIR is derived. Finally, an outer bound for the DoF 
region of the A-user MIMO interference channel (IC) with no 
CSIT is provided. 

Index Terms — DoF, CSIT, MIMO broadcast channel 

I. Introduction 

Spatial multiplexing is a key feature of MIMO communi¬ 
cation networks (TJ. The DoF region, which is the capacity 
region normalized by the logarithm of SNR in high SNR 
regimes, is a metric that captures the spatial multiplexing 
property. The DoF region of the MIMO BC with no CSIT 
was first shown in 0, 0 for the two user case and later in 
0 for the general A'-user BC. 

In this paper, we provide an alternative proof for the results 
obtained in the mentioned papers based on a simple lemma 
where its advantage over 0 is in extending the results of 0 
for the capacity region of special two-user broadcast channels 
to special K-user BCs. The paper is organized as follows. 
Section G3] introduces the system model. The alternative proof 
is provided in section [III] Subsequently, The capacity region 
of a certain A'-user MIMO BC with CDIT and an outer bound 
for the DoF region of the MIMO IC with no CSIT are provided 
in section m and section m respectively. 

Throughout the paper, / ~ o(logP) is equivalent to 
limp^oo j -Jpp = 0 and for a pair of integers m < q, the 
discrete interval is defined as [to : q] = {to, to + 1,..., q}. 
Y\w = {Y u Y i+ Yj}, Y([i : j]) = {Y{i),Y{i + 
1 Y(j)} and Y n = Y([l : n ]). R >o denotes the set 
of non-negative real numbers. 

II. System Model and Main Results 

We consider a MIMO BC, in which a transmitter with 
M antennas sends independent messages IC'i,..., Wk to A' 
users (receivers), where each receiver is equipped with TV, 
receive antennas (* £ [1 : A']). In a flat fading scenario, the 
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discrete-time baseband received signal of user i at channel use 
(henceforth, time slot) t can be written as 

Yi(i) = Hf (i)X(t) + Z 4 (i) , * € [1 : AT] , t £ [1 : n] (1) 

where X(f) £ C Mxl is the transmitted signal satisfying 
the (per codeword) power constraint Y^t=\ ll x WI| 2 < nP. 
H,(t) £ (jMxNi anc j 2, : (t) £ C NiXl are, respectively, the 
channel matrix and the additive noise vector of receiver i. 
The elements of H,(t) are independent identically distributed 
across time and users. The noise vectors and the elements 
of the channel matrices are allowed to have any tempered 
distribution (independent of X(t)) and the channel matrices are 
assumed to be full rank almost surely. We assume no channel 
state information at the transmitter and perfect local channel 
state information at the receiver (CSIR) i.e., at time slot t, user 
i has perfect knowledge of Hj([l : t}). 

The rate tuple (Pi, R 2 , ■ ■., Rk ), in which Ri = log (U y <l) , 
is achievable if there exists a coding scheme such that the 
probability of error in decoding W{ at user i(i £ [1 : AT]) 
can be made arbitrarily small with sufficiently large coding 
block length. Analysis of the capacity region C(P), which is 
the closure of the set of achievable rate tuples, is not always 
tractable. Instead, we consider the DoF region, which is a sim¬ 
pler metric independent of the transmit power, and is defined 
as {(di,..., P 2 , •. •, Rk) £ C{P) such that di = 

limp^oo lQ gp Vz}. At very high SNRs, the effect of additive 
noise can be neglected and what remains is the interference 
caused by other users signals. Therefore, the DoF region could 
also be interpreted as the region constructed by the number 
of interference-free private data streams that users receive 
simultaneously per channel use. 

Theorem 1. The DoF region of the K-user MIMO BC with 
no CSIT and perfect CSIR is given by 

K d 

D = {(d 1 ,d 2 ,...,d K )&R% 0 (2) 

i=l 

where ri = min{M, Nf). 

III. Proof of the theorem 1 

Unlike 0 and 0, the proof is not based on the degrad¬ 
edness of the MIMO BC under no CSIT. Without loss of 
generality, we assume Ni > N 2 > ... > N /< and we enhance 
the channel by giving the messages of users \i + 1 : A'] to 
user i. We also assume that each user not only knows its own 
channel, but also has perfect knowledge of the other users’ 
channels. In other words, perfect global CSIR is assumed. It is 
obvious that this assumption does not reduce the outer bound 
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which means that the bound with local CSIR is inside the 
bound with global CSIR; however, the achievability is based 
on only local CSIR. The region is further enhanced by giving 
all the noise vectors to each user. According to the Fano’s 
inequality 

nRi < /(Wi;Y"|n n ,A n , W [i+1:K] )+ ne n , i € [1 : K] (3) 

where Wk +i = 0 and e n goes to zero as n goes to infinity, 
f l n is the global channel state information up to time slot n 
and A" denotes the set of all the noise vectors across the users 
(extended over n time slots). Let .S', denote the index set of 
the r,(= min{M, A/j}) linearly independent elements of the 
TV,-dimensional vector I+X (note that S t is not necessarily 
unique). We decompose the TV,;-dimensional received signal of 
user i as Y, = (Y,, Y,) where Y, corresponds to the set of r, 
linearly independent elements having their index in Si , i.e. 
Y, = Yj i 5 i , and Y, can be reconstructed by linear combination 
of the elements in Y, within noise level. From the chain rule 
of mutual information, 


nRi<I{Wi^\n n , A", W [i+1:K] ) 

+ I(Wi ; A", W [i+1:K] , Y") +ne n . (4) 

■-v-' 

o(logP) 

For simplicity, we ignore ne n (since later it will be divided 
by n and n — > oo) and the term with o(log P) and write 


y .nRi A /(IL,;Y"|fr,A^, +1:J + 

i=i z=i 


K-l 


< | y 

tk +' 


h(Y?\n n ,A n ,W [i+1:K] ) 


n 


<n log P 

h(Y? +1 \n n ,A n ,W [i+1:K] ) 


Ti +1 


(5) 


where we have used the fact that /l ( Yi l n - A W[i -k]) ^ 

r i 

o(logP), since with the knowledge of A", W^-k], the 
observation Y™ can be reconstructed. Before going further, 
the following lemma is needed. The authors in a prove 
the following lemma in a combinatorial theoretic approach, 
while our proof is based on induction and simple properties 
of entropy. 

Lemma. Let = {+, Y 2 , ..., Yjv} be a set of TV(> 2) 
arbitrary random variables and + (F^) be a sliding window 
of size j over (1 < i,j < N) starting from Y t i.e., 


( r + — Y(j_i) w+ i, • • •, Y(i+j- 2)jv+i 


where (.+ defines the modulo N operation. Then, 


N 

(N ~m)h(Y [1:N] \A) <J2h(^- m (T N )\A) (6) 

»= 1 

where m £ [1 : N — 1] and A is an arbitrary random variable. 


Proof: We prove the lemma by showing that for every 
fixed m(> 1), © holds for all TV(> m + 1) using induction. 
It is obvious that for every m(> 1), © holds for TV = m +1. 
In other words, /i(Yi, Y 2 ,..., Y^\A) < YliLi h(Yi\A). Now, 


considering that © is valid for TV(> m + 1), we show that it 
also holds for N + 1. Replacing N with N + 1, we have 


(TV + 1 - m)h(Y [1:N+1] \A) 

= M^+tv+i] |A) +(TV — m)/i(Yji : jv-i], Vjv, Tjv+i | A) 

N 


< h(Y [1;N+1] \A) + E h(*?- m (<f> N )\A) 
1=1 
m 

= h(Y [1:N+1] \A) + J2 h(^~ m ^ N )\A) 


(7) 


N 


2=1 
-.N+l—mr 


+ E Ti(4'f +i_m (r Ar+ i)| J 4) 

i=m-\-l 

m+l:N] I^AT+1 1 ^\l:N—m] 5 A) 


( 8 ) 


+ E + h(Y N+1 ,Y [1:N _ m] \A) 


2=1 

N 


(9) 


+ E h(^ +1 ~ m (T N+1 )\A) 

i=m-\-1 

= h{Y[N—m+l:N] K AT+ 1 5 Y[l:N-m] 5 A) 
m N+l 

+ J2h(^~ m ^ N )\A)+ J2 h(^ +1 ~ m (T N+1 )\A) 

2=1 2=m+l 

m 

= E^ —m +2 I^AT+1, yjl;iV-m+2-l]? A) 


2=1 


N+l 


+ E>(Yi i:JV - ro+i _ 1]\A)+ E K*? +1 - m (r N+1 )\A) 

2=1 i=m+l 

( 10 ) 

m m 

<E^ —m+ 2 1 ^[ 2 : AT— m+ 2 — 1] 5 ++Em*i i:N— m+2— 1] 1+ 


N+l 


+ E h (** 


iV+l—m 


Fn+i)\A) 


( 11 ) 


i=m +1 


AT+1 


= E H^ +1 ~ m (XN+i)\A) + E M^f +1 “ m (riv + i)|+ 


2=1 

N+l 


i=m+1 


= E^ 


N+l—m 


(rjv +1 )|+ 


( 12 ) 


i=i 


where in ©. = {Y^. N _-^,Z} and we have used the 

validity of © for TV. In ©, we have used the fact that 
^ +1 ~ m (r N+1 ) = + v - m ($ Ar ) for i g [m+ 1 : TV] . In 
©, the chain rule of entropies is used and in (ITOt . the sliding 
window is written in terms of its elements. Finally, in (ITTl) . the 
fact that conditioning does not increase the differential entropy 
is used. Therefore, since m(> 1) was chosen arbitrarily and 
© is valid for TV = m+1 and from its validity for TV(> m+1) 
we could show it also holds for TV + 1, we conclude that © 
holds for all values of m and TV satisfying 1 < m < TV — 1. 
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Each term in the summation of (0 can be written as 

r l+1 h{\"\Si,n)~r x h{\"+i^u) 


GG+1 


r i+1 h(Y? 1 ,Y? 2 ,.. 


,!$,„) nh( Y? +1 |5i,„) 


< 


Ep =1 K% 


nn + 1 

; i+1 (r ri )|5 iin ) 


ri 

E 

p=1 

|"M® 


ri 

E 

p=1 

[MA. 



nn+t 

(r.Jl^n) 


Vi+l 

V 


r i r i+l 

nhW +l \S h n) 

GG+i 

hPi+i\Si, n ) 


GG+1 GG+1 

i , n X"+B p , I ,„|5 i ,„) /i(Ci, n X" 


D 


(13) 

(14) 

(15) 

(16) 

.| 5 i)n ) 


GG+1 


TiTi+l 


= 0 


(17) 

(18) 


where 5», n = {fi ra , A™, tE[ i+1: ^]} and in (fl5l >. since n +1 < 
Ti, the result of the previous lemma is applied in which 


— /v™ v n 

l 1 i,lJ 1 z,2; ' 


, Y" r .} is the set of r, linearly indepen¬ 


dent elements in Y". In (fTTb . we write 'kp‘ +1 (r r J and Y" +1 


as large nr-j+i dimensional vectors as follows. d/p ,+1 (r ri ) = 
TB Pj j ;n and Y ? _|_-^ = Cj ?n X +Dj jn where A P; 2 jn and 
C iin (£ (j nri+lXnM ) capture the channel coefficients over the 
n time slots, X" is the nM dimensional input vector and B pin 
and D, n capture the noise vectors over the n time slots. Since 
A p i n and Cj n are identically distributed channel coefficients 
and the noise terms are provided at each user, the arguments 
of the differential entropies in (fT71 > are statistically equivalent 
(i.e., have the same probability density function) which results 
in dH. Therefore, 0 is simplified to 


riR , 

- < n log P. 


i=l 


Vi 


(19) 


After dividing both sides by n log P and taking the limit 
n,P —> oo, we get 

E^ 1 - ( 20 ) 

i ' & 

2=1 

The above DoF region is achieved by a simple time sharing 
across the users where only local CSIR assumption is neces¬ 
sary. 

Note that since the noise can be non-Gaussian, Gaussian 
codes may no longer be DoF-achieving. Finally, it can be 
observed that the assumption of independent channels across 
the users was not used in the proof and since it does not 
change the achievability, it is not a necessary condition and 
can be relaxed. 

IV. Capacity region analysis 
In this section we consider i.i.d. Gaussian channels and 
noise vectors. We also assume M > N\ > N 2 > ... > Nk 
which results in ry = Ni(i £ [1 : K ]) and therefore, Y ?; = Y". 
From Fano’s inequality, 

y, nRj ^ y. I(W i; Y?\n n ,W [i+1:K] ) 

G — “ G 

2=1 2=1 


< 


H YSrlfi”) h(Y™\n n ,W [1:K] ) 


tk 


r 1 


( 21 ) 


n log( 27 re) 


K -1 r 

E 


h(Y?\n n ,w [i+1:K] ) h(Y? +1 \n n ,w [i+1:K] )\ 


G+l 


<0 


( 22 ) 


where the last non-positive term is a result of the lemma in 
the previous section. From the above results, we get an outer 
bound for the achievable rate region as 

K Ha n\\x\\ir\ 

(23) 


“ G nr K 
2=1 


Therefore, an outer bound for the ergodic capacity region is 

y^ Ri < max £ X :tr(£ x )<p E [logdet(I rK + H^SxHjf)] 

“ G “ r K 

2=1 

(24) 

and since the channels have i.i.d. Gaussian elements, the 
optimal input covariance matrix is jj I m 0. Hence, 

C°(P) = {(R 1 ,R 2 ,...,RK)€Rg 0 \ 


Ri < E 

K 


P , 


log det(I ri + — Hf Hi) 


Vi 


yR t ^ E [log det(I rg + ^HgH A -)] ^ 

.. ri ~ tk 

2=1 

It is obvious that the outer bound is more affected by the 
capacity of the point-to-point link from the transmitter to the 
user with the lowest number of receive antennas. 

Definition. We define a class of channels (a set of matrices) 
Q(p,q,m ) where each channel (matrix) in this class has its 
elements drawn from the distribution p in such a way that 
the optimal input covariance matrix for achieving the capacity 
of the point-to-point link from the transmitter to the virtual 
user defined by this channel is diagonal with equal entries. 
The details for this condition are given in 0 Exercise 8.6]. 
We also assume that for each channel in this class, all the 
singular values have the distribution q. In other words. 


Q(p,q,m) = {H £ C mx " \/ n < m | Elements of H ~ p, 

p 

arg max E [logdet(I„ + H h Y, x H)\ = — I m , 
£ X :tr(£ x )<P TO 

and Xi(H H H) ~ q,\/i = 1,... , rank(fT)} . (26) 


Theorem 2. In a K-user Gaussian MIMO BC with M > 
JVi > N 2 > ... > Nk and all the channels from the class of 
Q(p,q, M), the capacity region with CDIT is given by 


K 


lo g(! + ^A) 


(27) 


C(P) = \(R [1 .. K] )eR^ 0 \J2- 1 <E q 

■ 'i 

K i=l 

where E q [log(l + -jgA)] = /log(l + j^x)q(x)dx. 

Proof: According to (l24l > and the properties of 
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0(p, g, M), we have 


R * ^ ££ E Mi+ (28) 


Tk 


If the singular values of H k have the same distribution, we 
can write 


y — <E 

*-! n 
2—1 


loga + ^ArfH^Hx)) 


(29) 


Also, if the singular values have the same distribution across 
the users, the outer bound is easily achieved by orthogonal 
transmission strategies, and therefore it is the optimal capacity 
region. ■ 

A special case of theorem 2 was shown for the two user 
Gaussian MIMO BC in 0, in which all the eigenvalues of 
H^Hfc(fc = 1,2) are unity. 


V. MIMO INTERFERENCE CHANNEL WITH NO CSIT 


Consider a K -user MIMO IC with K transmitters and K 
receivers equipped with M t and N, antennas, respectively (i = 
1,2,..., K). The input-output relationship at channel use t is 
given by 


K 

Y i(t) = ^H5(f)X,(t) +Z i (t) ,i € [1 : K] ,t e [1 : n] 
3=1 

(30) 

where Y,(f) is the received signal at receiver i, H,. ; - is 
the channel matrix from the transmitter j to the receiver i, 
Xj (t) is the transmitted vector by the transmitter j satisfying 
e;uim*)ii 2 < nP and Zj(f) is the noise vector at the 
receiver i. We assume that the channels are drawn from the 
same distribution, while the noise vectors could have different 
distributions. We also assume perfect CSIR (each receiver 
knows all the incoming channels to it from all the transmitters) 
and no CSIT. 

For the two user case, theorems 2 and 3 in 0 are combined 
into theorem 5 in 0. Here, we provide an alternative proof 
for it. We assume Ni < N 2 and r, = min(M 2 , Ni). By giving 
the message of user 1 to user 2, we have 


nR 1 nR 2 ^ I(W\]Y 1 |fl n , A n ) I(W 2 - Y 2 |Q", A", Wi) 


n r 2 n 

h{Yi\n n ,A n ) 
n 


T2 


- o(logP) 


h(Y 2 |f2", A n , W\) h(Y 1 \Q n ,A n ,W 1 ) 


r 2 


n 


(31) 


nmm(N 1 ,M 1 +M 2 ) 

< -log P 

n 

+ r 1 h(n\^ n ^ n ^ 1 )-r 2 h{Y n l \^,A n \W 1 ) 

nr 2 


<0 


where Y 1 and Y 2 are the same as those in 0 and we have 
neglected all the terms with o(logP). In (l32l >. h(Y 1 |fl", A ra ) 


is maximized when Y” is Gaussian received from a transmitter 

with Mi + M 2 antennas. Also, in the term — — 1 1 1 — 

1 r 2 

/i(\ 1 |c^,A dG) j^ s j nce ,-jjg en t ro pies are conditioned on W±, 
the values of Xi(l),Xi(2),... ,Xi(n) are known. Therefore, 
the extensions of H^(f)Xi(f) and H^(f^Xi(f) overn channel 
uses can be removed from Y-, and Y 2 , respectively. What 
remains is a broadcast channel with a transmitter having M 2 
transmit antennas. With a difference of o(logP), we can 

~ n ~ n 

replace Y x and Y 2 with their linearly independent elements 

Y" and Y 2 , respectively as in 0. Since r t < r 2 , following 
the same approach as in the formulae ( IT3l ) to ( ITSi ). we get the 
non-positive term in ( l32l >. Therefore, the outer bound is 

D° = {(dr, d 2 ) e P> 0 | d t < min(Mj, 7V, ; ) * = 1,2 and 
<h + <h < min(W 1 ,Mi + M 2 ) 
r\ r 2 ~ n 

For the /\-user IC, following the same proof in this paper for 

the broadcast channel, an outer bound for the DoF region can 
be obtained if the transmitters cooperate to make a broadcast 
channel (with Mt = E, 3p). Therefore, 

D° = {(di, d 2 ,..., dx) S P> 0 | 

K d 

di < min (Mi,Ni) V* and ^ 1 < 1}. 

“ mm(M T ,A i ) 

(34) 

According to theorem 9 in 0, the above outer bound is tight 
provided that either Ni < Mi V* or Ni = TV > M = Mi V* 
where in the former time sharing across the users and in the 
latter receive zero-forcing and time sharing are the achievable 
schemes, respectively. 

VI. Conclusion 

In this paper, a new proof for the DoF region of the /T-user 
MIMO BC with no CSIT was provided. The advantage of this 
proof is in finding the capacity region of a specific class of 
the A'-user Gaussian MIMO BC with CDIT as explored in 
section [IV] Also, an outer bound for the DoF region of the 
A-user MIMO IC with no CSIT was provided. 
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